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DECLARATION OF ANDREJ SALT PH.D. UNDER 37 C.F.R, § L132 

I, Andrej Salt, Ph.D., hereby declare that: 

L I am a citizen of the United States and reside in San Francisco, 

California. My c.v. is attached as Exhibit A, 

2. I have been retained by Schering-Plough to provide my opinion on 
certain issues in the proceedings involving U.S. Patent Application No. 
09/963,347 ("the '347 application"). 

3. I have reviewed the following documents in the course of preparing 
this declaration: 

L The '347 application 

iL US. Provisional Application No. 60/101,318 ("the 318 
provisional application") 

iii, Cosenza et aL, "Disulfide Bond Assignment in Human 

Interleukin-7 by Matrix-assisted Laser Desorption/Ionization 
Mass Spectroscopy and Site-directed Cysteine to Serine 
Mutational Analysis," J, BtOL. CHEM. 272(52): 32995-33000 
(1997) 
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iv. Kroemer et a/.* ""Prediction of the three-dimensional structure 
of human mterleukin~7 by homology modeling*" Prot\ Eng. 
9(6): 493-498 (1996) 

v. Pearson, "Effective protein sequence comparison," Methods 
Enzymol. 266; 227-258 (1996) 

vi. Mott and Campbell, 'Tour-helix bundle growth factors and 
their receptors; protein-protein interactions," Cuml Opik. 
Struct, Biol, 5(1); 114-121 (1995) 

vii. Kruse et aL, "Two distinct functional sites of human interleukin 
4 are identified by variants impaired in either receptor binding 
or receptor activation," EMBO J. 12(13): 5121-5129(1993) 

viii. Park et aL t "Intermediate sequences increase the detection of 
homology between sequences "J. Mol. Biol. 273(1): 349-354 
(1997) 

ix. Rost> "PHD: predicting one-dimensional protein structure by 
profile-based neural networks," Methods Enzymol* 266: 525- 
539 (1996) 

I have worked in the field of bioinformatics since 1987 and have 
extensive experience in the fields of bioinformatics, computational 
structural biology, and predicting the structure and function of proteins 
in general. See Exh. A. I believe I am well-qualified to express an 
opinion regarding what the "318 provisional application would have 
conveyed to a person of skill in the art reading that application at the 
time it was filed 

I was asked to review the '318 provisional application to determine if a 
person of skill in the art at the time that the application was filed would 
have considered the statement relating to the ability of IL-B50 to 
stimulate or inhibit T cells and B cells to be credible in view of the 
disclosed similarities between 1L-B50 and 1L-7. 

In order to determine whether or not the disclosed sequence similarity 
between IL-B50 and DL-7 is statistically significant, I used the program 
PRS33 from the package FASTA; default values for ail program 
options were used unless stated otherwise. See, e.g. Pearson, 
Methods Enzymol, 266: 227-258 (1996); 
hUp://fasta,bioch,virgmiaxdu, PRSS3 was well known as of 
September of 1998 and was commonly used by those skilled in the art 
to evaluate the statistical significance of a similarity score between two 
given protein sequences. 

PRSS3 first calculates an optimal local alignment and the 
corresponding "observed 1 * similarity score using the Smith- Waterman 
algorithm for dynamic programming. It then calculates the distribution 
of the similarity scores for many pairs of original sequences whose 
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residues have been shuffled to make them "random/* Next, an extreme 
value distribution is fit to this "random" distribution and the fitted 
parameters are used to estimate the probability (the p~value) that by 
chance alone the original sequences have a similarity score at least as 
high as the observed similarity scorn. In addition, the p-value is 
presented in a modified form as an E-value, which gives the number of 
matches with a score at least as high as the observed similarity score 
that would have arisen by chance when one of the original sequences is 
compared against a database containing a specified number of 
unrelated sequences. It is generally accepted that E-values smaller 
than 0.02 essentially prove homology. See, e.g. Pearson, METHODS 
ENZYMOL. 266: 227^258, 256 (1996), 

8. Using the PRSS3 program, I found that the sequence similarity 
between and IL-B50 is statistically significant when the 
comparison is made using an appropriately chosen database of human 
proteins known in 1998. In 1998, the SwissProt database contained 
approximately 60,000 proteins, Currently, SwissProt contains 270,778 
proteins, of which 16,619 (or 6. 1%) are human (6/15/07). See 
http://ca,expasy.org/sprot/refootes/relstat. Assuming the proportion of 
the human sequences in SwissProt in 1998 was the same as today 
6,1%), I estimated the number of human proteins in SwissProt in 1998 
to be approximately 3,600. This is a reasonable estimate, considering 
the human genome sequencing efforts did not start producing human 
protein sequences in earnest until 1999, 

9. The E-values (a parameter describing the number of hits that one can 
"expect" to see by chance when searching a database of a particular 
size) between DL-B50 and human and murine IL-7 when searching 
only human proteins as of September 1998 are approximately 0J6 and 
0 + 01, respectively. The latter E- value is statistically significant While 
the E- value of 0.16 between the human IL-B50 and human IL-7 is not 
considered to guarantee homology on its own, it still indicates 
homology per generally accepted standards. See, e.g. Pearson, 
Methods Enzymol. 266: 227-258, 256 (1996). Moreover, because 
there is unequivocal homology between human EL-7 and murine IL-7 
as demonstrated by the statistically significant sequence similarity 
score of 0.01, there must also be homology between the human IL-B50 
and human IL-7. This kind of transitive logic is generally accepted in 
detecting homology based on an intermediate sequence search 
approach. See, e.g. Park et aU J* MOL, BXGL. 273: 349-354 (1997). I 
note that even if there were twice as many known human protein 
sequences as estimated here, that fact would not have changed the 
statistical significance of the observed similarity scorns. Therefore, my 
conclusions are not dependent on a precise estimate of the number of 
human protein sequences known in 1998. 

10. The version of PRSS3 that is currently available is not exactly that 
same as the version available in 1998 T however they both use the same 
algorithms for calculating the optimal alignment and the corresponding 
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E- value. Therefore, to the best of my knowledge* the E- values 
calculated with the version of the PRSS3 program that existed in 1998 
would be essentially the same as the E- values I calculated using the 
currently available program. 

11. I additionally compared the predicted secondary structures of EL-7 and 
IL-B50. The secondary structures were predicted by the program PhD, 
which was available in 1998, See Rost, Methods ENZYMOL. 266: 
525-539 {1996), The predicted secondary structures of IL-7 and IL~ 
B50 suggest that the two proteins indeed have the four helices typical 
of the cytokine 4-helix bundle. This finding is substantiated by 
independent predictions for BL-7 made by others and published before 
1998. See Kroemer etal, P&OT. ENG. 9: 493-498 (1996). Further, the 
IL-7 regions found (by site-directed mutagenesis) to be important for 
binding other proteins, including the receptor, are also the regions 
where the similarity to BL-B50 is highest, including helices A and D as 
well as loops A-B and C~D, See, e.g. Mott and Campbell, Curr. Opin. 
Struct. Biol. 5(1): 114-121 (1995); Kruse etaU EMBO J. 12(13): 
5121-5129 (1993); '318 provisional application at p. 1L 

12. Further, IL-B50 and IL-7 share conserved short sequence patterns 
indicating PKC phosphorylation, Asn glycosylation, and myristilation 
sites. The proteins also share one approximately conserved pattern 
indicating protein kinase CK2 phosphorylation. These functional 
sequence motifs were well characterized within the art prior to 
September 1998. 

13. Based on: (i) the statistically significant similarity of the IL-7 and IL- 
B50 sequences, (ii) the similarity between the predicted secondary 
structures of IL-7 and DL-B5G* (iii) the higher similarity in known 
functionally important regions compared to the rest of the sequence, 
and (iv) the conserved short functional sequence motifs* if I were 
reading the '318 provisional application at the time that it was filed, I 
would have believed that IL-B50 and IL-7 have similar sequences and 
biological functions. Accordingly, I would have found the utility 
asserted in the '318 provisional application to be credible. 



I declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and 
further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or 
both, under Section 1001 of Title 18 of the United States Code and that such 
will false statements may jeopardize the validity of the patent application or 
any patent issuing thereon. 




6/19/07 



Andrej Sah, Ph D. Date 
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EXHIBIT A 



Curriculum vita* 



Name: Andrej Sail 



Position: Professor, Step V 

Department of Biopharmaceutical Sciences 
Department of Pharmaceutical Chemistry 

California Institute for Quantitative Biomedical Research (QB3) 
School of Pharmacy 



Integrative Program in Quantitative Biology: 

Bioinformatics and Medical Informatics Graduate Program 
Biophysics Graduate Program 
Graduate Program in Complex Biological Systems 
Chemistry and Chemical Biology Graduate Program 
Pharmaceutical Sciences and Pharmacogenomics Graduate Program 

Address: Byers Hall, Suite 5G3B, MC 2552 

University of California at San Francisco 
1700 4 th Street 

San Francisco, CA 94158-2330 



Voice: +1-415-514 4227 

Fax:+1-415-514 4231 
email: sali{£).salilab.org 
web: http://salilab,org 



EDUCATION 



1983-1987 University of Ljubljana, Slovenia BSc Chemistry 
1987-1991 University of London, UK PhD Molecular Biophysics 



PRINCIPAL POSITIONS HELD 



1 987-1 99 1 PhD student. Department of Crystallography, Birkbeck College, and Imperial 
Cancer Research Fund, London, UK (Mentor: Prof Thomas L. Blundell, FRS), 

1991-1994 Postdoctoral Fellow at Department of Chemistry, Harvard University, 
Cambridge, USA (Mentor: ProL Martin Karplus). 



1995-2000 Assistant Professor, Head of Laboratory. The Rockefeller University. 
2000-2003 Associate Professor, Head of Laboratory. The Rockefeller University, 
2003- Professor in Departments of Biopharmaceutical Sciences and Pharmaceutical 

Chemistry, and California Institute for Quantitative Biomedical Research. 

University of California at San Francisco. 



OTHER POSITIONS HELD CONCURRENTLY 



2003- Vice Chair of Computational Biology in Dept. of Biopharmaceutical Sciences, 

University of California at San Francisco. 



HONORS, AWARDS, AND FELLOWSHIPS 



1984-1987 Undergraduate scholarship from J, Stefan Institute, Ljubljana, Slovenia. 

1985 British Council Visiting Student at Birkbeck College, London, UK. 

1 987 British Council Visiting Student at Birkbeck College, London, UK. 

1987-1990 Overseas Research Students Award from the Committee of Principals and Vice 
Chancelors, England. 

1987-1988 Scholarship for graduate studies from the Research Council of Slovenia. 

1989-1 990 M erck Sharp & Dohm academic scholarship , 

199L 1994 Jane Coffin Childs Memorial Fund for Medical Research Postdoctoral Fellow. 

1 996- 1 999 Alexandrine and Alexander L. Sinsheimer Scholar. 

1998-2000 Alfred R Sloan Research Fellow. 

2000-2003 Irma L Hirschl Career Award Scientist. 



KEYWORDS/AREAS OF INTEREST: 



We employ the laws of physics and the rules of evolution to develop and apply methods for: 

• predicting the structures of proteins; 

• determining the structures of macromolecuiar assemblies; 

• annotating the functions of proteins and their assemblies using their structures. 



SOFTWARE 



1993 MODELLER, a program for comparative protein structure modeling by satisfaction 
of spatial restraints; licensed to Accelrys Inc. since 1994. 



2000 MODPIPE, a program for large-scale comparative protein structure modeling; 

licensed to Accelrys Inc. and Structural Genomix Inc. (2000-2004) 
2000 MODBASE, a comprehensive database of comparative protein structure models; 

licensed to Structural Genomix Inc. (2000-2004) 



PROFESSIONAL ACTIVITIES 

Member of professional societies: 

* Protein Society. 

Organization of meetings: 

* Program Committee, Georgia Tech IntL Conference on Bioinformatics, Atlanta GA. 1999- 
200 L 

* Program Committee, Math/Chem/Comp 2001, Dubrovnik, Croatia. June 25-30, 200 L 

* Program Committee, Protein Society 19* Symposium in Boston, July 2005. 

* Program Committee, Protein Structure Modeling Workshop, Rutgers University, NJ, 
November 2005. 

* Program Committee, American Society for Biochemistry and Molecular Biology, April 
2006. 

* Organizer, World Molecular Engineering Network, annual TSRI & UCSF meeting in San 
Jose del Cabo, Mexico, 2006- . 

* Program Committee, 4th Conference on Modeling of Protein Interactions - MPI 2007 

Service to professional publications: 

* Editor, Structure. 

* Member, Editorial Board, PLOS Computational Biology; Journal of Computer Aided 
Molecular Design; Molecular and Cellular Proteomics; Protein Engineering, Design, and 
Selection, 

* Section Head for the Structural Genomics section on Bio Med Central 

* Reviewer for Nature, Science, Cell, Proc.NatLAcad.ScLUSA,, Nat, Struct. MoL Biol, 
Nat.Gen., NatBiotech., Structure, J, MoLBioL, Proteins, ProtEng,, Prot.ScL, 
Bioinformatics, NucLAcids Res., J.BioLChem., BMC Structural Biology, Genome 
Biology, FEBS Letters, J.Com.Aid.MoLDes., Biophysical Journal, Biochemical Journal, 
PLoS Biology, and PLoS Computational Biology. 

Government and other professional service: 

* Executive Committee of the Protein Society (2005-). 

* Advisory Committee of the Protein Data Bank (2005 ~) 

* NIH/GM MSF-B Study Section (successor of BBCA) (2004-). 



• Reviewer for NIH, NSF, DOE, Burroughs Wellcome Fund, European Community, and 
BSF grant proposals. 

INVITED PRESENTATIONS (since 1999) 

1. Structural Genomics Targets Workshop, NIH, Washington DC, USA. February 11, 1999. 

2. Advances & Opportunities at the Biology/Math/Computational/Physical Sciences Interface, 
Rutgers University, New Brunswick, New Jersey, USA. March 6, 1999. 

3. Mount Sinai School of Medicine, New York, New York, USA. March 19, 1999. 

4. New York Structural Biology Group, New York Academy of Sciences, New York, New 
York, USA. March 24, 1999. 

5. Columbia University, New York, New York, USA, 1999. 

6. Chemistry Dept., New York University, New York, New York, USA, 1999. 

7. CERCA CADD Symposium, Montreal, Canada. April 13, 1999. 

8. Protein Sequence Structure Function Meeting, UCSF, San Francisco, California, USA April 
23, 1999. 

9. Data Mining in Crystallography, Erice, Italy. May 15, 1999. 

10. Structural Biology Net, Tallberg, Sweden. June, 1999. 

1 1. The Scripps Institute, La Jolla, California, USA. August 13, 1999. 

12. Frontiers in Structural Biology, Indian Institute of Science, Bangalore, India. August 27, 1999. 

13. Mathematical Problems in the Molecular Sciences, Courant Institute, New York, New York, 
USA. October 10, 1999. 

14. City College of New York, New York, USA. October 20, 1999. 

15. Agouron Pharmaceuticals, San Diego, California, USA. October 28, 1999. 



16. Second International Georgia Tech Conference in Bioinformatics, Atlanta, Georgia, USA. 
November 12, 1999. 

17. Structural Genomics Conference, ANL, Chicago, Illinois, USA, November 16, 1999. 

18. Structural Genomics and the Pharmaceutical Industry, Princeton, New Jersey, USA. 
November 18, 1999, 

19. BRI, Montreal, Canada. November 24, 1999. 

20. University of Toronto, Toronto, Canada. December 9, 1999, 

2L Quantitative Challenges in the Post Genomic Sequence Era, La Jolla Interfaces in Science, San 
Diego, California, USA. January 12, 2000, 

22. Japan Biophysical Society Meeting, Tokyo, Japan. January 17, 2000. 

23. UCSD, Dept of Physics, San Diego, California, USA, January 19, 2000. 

24. UCSF, San Francisco, California, USA. January 20, 2000. 

25. Center for Physics and Biology, Rockefeller University, New York, New York, USA. January 
24, 2000. 

26. Biological Chemistry Seminar Series, University of Penn, Philadelphia, Pennsylvania, USA. 
February 17, 2000. 

27. ABRF 2000 ^From Singular to Global Analyses of Biological Systems", Bellevue, 
Washington, USA. February 22, 2000, 

28. AAAS conference, Washington DC, USA. March 20, 2000. 

29. Keystone Symposium on Macromolecular Assemblies at Work: Application of Physics, 
Chemistry, and Mathematics to Biology, Durango, Colorado, USA. March 25, 2000. 

30. Bio2000 ; Boston, Massachusetts, USA, March 28, 2000, 

3 1 . Bioinformatics 2000, Elsinore, Denmark. April 28, 2000, 

32. Computational Challenges of the Post Genomic Age, SUN, San Francisco, California, USA. 



May 12, 2000. 

33. Btopolymers Gordon Conference, Newport, Rhode Island, USA. June 18-22, 2000. 

34. 2000 FASEB Summer Research Conference on Protein Folding in the Cell, Saxton River, 
Vermont, USA. July 22-27, 2000. 

35. Monsanto/Pharmacia lectureship series, Univ. of Saint Louis, Missouri, USA. September 28, 
2000. 

36. Workshop on Structural Genomics. NIGMS, Washington DC, USA. October 23, 2000, 

37. Genomics and Bioinformatics, UMD, New Brunswick, New Jersey, USA, November 2, 
2000. 

38. University of Minnesota, Minneapolis, Minnesota, USA. November 27, 2000. 

39. Oncogenomics: Dissecting Cancer Through Genome Research, Nature Genetics, Tuscon, 
Arizona, USA, January 25-27, 2001. 

40. UAB, Birmingham, Alabama, USA. February 26, 2001. 

41. Structural Genomics and Bioinformatics, Instituto Juan March, Madrid, Spain. March 12-14, 
2001. 

42. Bard College, New York, USA, April 18, 2001- 

43. Physics/Chemistry. CSUN, Northridge, California, USA. May 2, 2001- 

44. Annual meeting of the Canadian Society for Biochemistry and Molecular and Cellular Biology 
(CSBMCB), Toronto, Canada, May 31-June 3, 2001. 

45. Math/Chem/Comp 2001, Dubrovnik, Croatia. June 25-30, 2001. 

46.4th International Conference on Biological Physics, ICBP2001, Kyoto, Japan. July 30- 
August 3, 2001. 

47. ACS Meeting, Chicago, Illinois, USA. August 26-30, 200 L 

48, 4th International Conference on Molecular Structural Biology, ICMSB2001, Vienna, Austria. 



September 5-9, 200 1 

49. University of Maryland, Maryland, USA, October 2, 2001. 

50. Columbia University, New York, New York, USA. October 15, 200 L 

5 1 . Genomics & Proteomics meeting, Barcelona, Spain. October 1 9, 200 1 > 

52. Mast Cell Workshop, Bethesda, Maryland, USA. November 26-30, 2001. 

53. University of Zuerich, Zuerich, Switzerland. December 5, 200 L 

54. Bioinformatics & Proteomics: From Sequence to Function, Lausanne, Switzerland December 
6, 200L 

55. Genomics Seminar Series, SkirbatI Institute, New York, New York, USA. February 6, 2002. 

56. Mining the Human Genome for New Drug Discovery - New Ways of Handling Orphan 
Targets. NYAS, New York, New York, USA. February 26, 2002, 

57. Biological Processes for New and Innovative Engineering Systems and Applications, ARO 
workshop, Research Triangle Park, North Carolina, USA, February 26*27, 2002. 

58. New York City Blood Centre, New York, New York, USA. March 7, 2002. 

59. Proteomics- The New Frontiers: Discovery, Separation, Prediction & Modeling, University of 
Delaware, Newark, Delaware, USA. March 14-15, 2002, 

60. Harvard University, Cambridge, Massachusetts, USA, March 28, 2002. 

61. Molecular Cell Biology and Biochemistry Seminar Series, Virginia Tech, Blacksburg, Virginia, 
USA. April 5, 2002. 

62. UCSF, San Francisco, California, USA, April 15, 2002. 

63. A Workshop on large biological structures, Asiiomar, California, USA. April 2G-22 ; 2002. 

64. SCBMB Program, Baylor College of Medicine, Houston, Texas, USA. May 15, 2002. 

65. University of Barcelona, Barcelona, Spain. May 21, 2002. 



66. Genomics & Proteomics meeting, Barcelona, Spain. May 22, 2002, 

67. Samuel Lunenfeld Research Institute, Toronto, Canada. May 29, 2002, 

68. 50th ASMS conference American Society of Mass Spectrometry, Orlando, Florida, USA. 
June 2-6, 2002. 

69. The 5th Summer Session of the New York Structural Biology Discussion Group , Cold 
Spring Harbor Laboratory, New York, USA. June 26, 2002. 

70. Berkeley-Stanford summer school for protein crystallography, SSRL, Stanford, California, 
USA, July 8*12, 2002. 

71. Gordon Conference on Diffraction Methods in Structural Biology, Connecticut College, New 
London, Connecticut, USA. July 14-19, 2002. 

72. The 19th Congress and General Assembly of the International Union of Crystallography 
IUCR, Geneva, Switzerland. August 6-15, 2002. 

73. NYCBS New York Computational Biology Society seminar, NAS, New York, New York, 
USA, September 18, 2002. 

74. Center for Biological Modeling, Michigan State University, East Lansing, Michigan, USA. 
September 27, 2002. 

75. Bioinformatics seminar, Texas A&M University, Tamu, TX, USA. November 7, 2002. 

76. Genomics and Bioinformatics Center Inaugural Symposium, Pontificia Universidad Catolica, 
Santiago, Chile. November 18-20, 2002. 

77. Structure and Function of the Proteome, Argonne National Laboratory, Argonne, Illinois, 
USA. November 23-24, 2002. 

78. Keystone Symposium in Proteomics: Technologies and Applications, Keystone Resort in 
Keystone, Colorado, USA, March 25-30, 2003. 

79. NCRR sponsored Workshop on Structural Proteomics of Complexes, Bethesda, Maryland, 
USA. April 7-8, 2003. 



80. American Society for Biochemistry and Molecular Biology meeting, San Diego, California, 
USA. April 11-15,2003. 

8L St. Jude Children's Research Hospital, Memphis, TN, USA. April 22, 2003, 

82. Genentech, Inc., South San Francisco, CA, USA. April 29, 2003. 

83. Fourteenth Annual World Molecular Engineering Network (WMEN) Conference, San Jose 
del Cabo, Baja California Sur, Mexico. May 4-8, 2003. 

84. Structure and Function of Proteome, SBC, Argonne National Laboratory, Argonne, Illinois, 
USA, Spring, 2003, 

85. "Frontiers of Bioinformatics" symposium, Center of Excellence in Bioinformatics, University 
at Buffalo, Buffalo, New York, USA. June 6-8, 2003. 

86. IBM Thomas J, Watson Research Center, New York, New York, USA. June 11, 2003. 

87. 2003 Gordon Research Conference on 3D Electron Microscopy of Macromolecules, Colby 
Sawyer College, New London, New Hampshire, USA. June 22-26, 2003. 

88- PS1 workshop ond data management, NIH Campus, Bethesda, MD, USA. July 10-11, 2003. 

89. GTL and Beyond: Data and Computational Needs Workshop, San Francisco, CA, USA, 
September 10-11,2003. 

90. 2003 Pharmaceutical Sciences and Pharmacogenomics Retreat, Marshall, CA, USA. 
September 11-13, 2003. 

9L Structure and Chemistry Seminar at Scripps, San Diego, CA, USA. September 18, 2003. 

92. 5th Meeting of the Slovenian Biochemical Society, Ljubljana, Slovenia, September 24-28, 
2003. 

93. Seminar at Northeastern University, Boston, MA, USA. October 6, 2003 

94. Workshop on Visualization of Biological Complexes, Four Points Sheraton Hotel, Emeryville, 
San Francisco Bay Bridge, CA, USA, October 11-12, 2003 

95. Seminar at PARC, Palo Alto, CA, USA. October 15, 2003, 



96. NIGMS Homology Modeling Workshop, Bethesda, MD, USA, October 21-22, 2003. 

97. Seminar at Purdue University, West Lafayette, IN, USA. October 24-25, 2003. 

98. PSI Target Selection Workshop, Bethesda, MD, USA. November 13-14, 2003. 

99. Biophysics/CCB Retreat, Asilomar Conference Center, Pacific Grove, CA, USA. December 
7-9, 2003, 

100. University of Cologne, Cologne, Germany. January 19, 2004, 

101. Ringberg meeting, Schloss Ringberg, Germany. January 21-23, 2004, 

102. Licensing Executives Society meeting, San Francisco, CA, USA. February 12, 2004. 

103. The Structural, Functional and Evolutionary Gordon Conference, Four Points Sheraton 
Harbortown, Ventura, CA, USA. February 15-20, 2004. 

104. Seminar at Berkeley, Berkeley, CA, USA, March 8, 2004, 

105. Seminar at UCSC, Santa Cruz, CA, USA. March 11, 2004. 

106. Workshop on Structure Determination of Macromotecular Machines and Assemblies by 
Hybrid Methods, Granlibakken/Lake Tahoe Conference Center, CA, USA. March 17-20, 
2004. 

107. Workshop on Structural Determination of Environmentally Responsive Gene (ERG) 
Products for Diagnostics & Drug Discovery (NIEHS/DERT), Snowbird Resort, Snowbird, 
Utah, USA April 12-13, 2004. 

108.2004 Keystone Symposium on Structural Genomics, Snowbird Resort, Snowbird, Utah, 
USA April 13-19, 2004. 

109. BayGenomics PGA, San Francisco, CA, USA. April 27, 2004. 

1 10. Fourteenth Annual World Molecular Engineering Network (WMEN) Conference (2004), 
San Jose del Cabo, Baja California Sur, Mexico, May 2-6, 2004. 

1 1 L Gladstone Scientific Retreat, Asilomar in Monterey County, CA, USA. May 18-20. 2004. 



1 12. Seminar at Caltech, Pasadena, CA, USA. October 12, 2004, 

113. EMBO conference on Structures in Biology, EMBL, Heidelberg, Germany. November 10- 
13, 2004. 

1 14. 1CSG 2004 Meeting, Washington, DC, USA. November 17-24, 2004. 

115. Workshop of the Center of Protein Folding Machinery, Stanford University, CA, USA, 
December 4-5, 2004. 

116. Biological and Medical Informatics/Biophysics/Chemistry and Chemical Biology graduate 
groups retreat, Asilomar Conference Center, Pacific Grove, CA, USA. December 5-7, 2004. 

117. The 7th World Congress of the World Association of Theoretically Oriented Chemists 
(WATOC), Capetown, South Africa. January 16-21, 2005. 

118. ABRF meeting, Biomolecular Technologies: Discovery to Hypothesis, Savanah, Georgia, 
USA. February 5-8, 2005. 

119. Speaker at Frontiers in Computational Biophysics Symposium, N1H campus in Bethesda, 
MD, USA. April 29-30, 2005. 

120. NIH Symposium on Structural Analysis of Large Assemblies: Sizing up the Challenges, 
NIH campus in Bethesda, MD, USA. June 2-3, 2005. 

121. Speaker at SRTs Computational Biology series, SRI International, Menlo Park, CA, USA. 
June 29, 2005. 

122. 19th Annual Symposium of the Protein Society, Boston, MA, USA, July 30 - August 3, 
2005. 

123. GRC 2005 Computer-aided design meeting, Tilton School, NH, USA. July 31 - August 5, 
2005. 

124. Keynote Speaker in XX lUCr Congress in Firenze, Italy. August 23-3 1, 2005. 

125. Seminar at the Biochemical and Biophysical Methods Course Fall 2005, The Rockefeller 
University, New York, NY, USA. October 1 1, 2005. 

126. Seminar at the Novartis Institutes for BioMedical Research, Cambridge, MA, USA. 



November 9, 2005. 

127. Workshop on Biological Macromolecular Structure Models, The State University of New 
Jersey, Piscataway, NJ, USA. November 19-20, 2005. 

128. Speaker at the International Workshop M2CELL, The Royal Abbey of Fontevraud, Paris, 
France. December 4-6, 2005. 

129. Organizer and Speaker of the Theme "Macromolecular Structure and Dynamics" with 4 
Symposia. ASBMB 2006 meeting, San Francisco, CA, USA. April 1-5, 2006. 

130. Organizer and Speaker of the World Molecular Engineering Network Conferenc. Cabo San 
Luca, Mexico. April 30-May 2nd, 2006. 

131. Plenary Speaker at the 11th Sympsosium on Recent Advances in Biophysics, National 
Taiwan University, Taipei, Taiwan. May 23-26, 2006 

132. Seminar at UC Davis, CA, USA. June 1, 2006. 

133. 2006 Keystone Symposium on Multi-Protein Complexes Involved in Cell Regulation, St. 
John's College, Cambridge, UK. August 18-23, 2006. 

134. Symposium at Wyeth Research, Cambridge, MA, USA. October 16, 2006. 

135. Seminar at the University of Massachusetts, Dept. of Biochemistry and Molecular 
Pharmacology, Worcester, MA, USA. October 18, 2006. 

136. Seminar at the Fifth Annual Systems Biology Course at the Institute for Systems Biology, 
Seattle, WA, USA. November 9, 2006. 

137. Seminar at UC Merced Center for Computational Biology, Merced, CA, USA. November 
30, 2006. 

138. Speaker at the TDR/WHO Drug Target Selection Meeting in Seattle, OR, USA. December 
1 st , 2006. 

139. Speaker at the Biological and Medical Informatics/Biophysics/Chemistry and Chemical 
Biology Graduate Groups Retreat, Monterey, CA, USA. December 3-5, 2006. 

140. Speaker at the Collaborative Drug Discovery meeting, UCSF, San Francisco, USA. March 
1 st 2007. 



141. Organizer and Speaker of the World Molecular Engineering Network Conference. Cabo San 
Lucas, Mexico. April 29-May 2nd, 2007. 

142. Invited Speaker at the Protein Complexes and Protein Networks Symposium in Martinsried, 
Germany, May 21-22, 2007. 

143. Speaker at The Protein Folding Center Annual Retreat, Stanford, CA, USA. May 27-29 
2007. 

144. Invited Speaker at the Center for Theoretical Biological Physics. UCSD,San Diego USA. 
June 1st, 2007. 

145. Speaker and Organizer of the Modeling of Protein Interactions Meeting, MPI-2007, 
Lawrence, KA, USA, September 30-October 2, 2007. 

146. Invited Speaker at the Keystone Symposium on Structural Genomics and Its Applications to 
Chemistry, Biology and Medicine, Steamboat Springs, Colorado, CO, USA, January 6-11 
2008. 

147. 40th Course: From Molecules to Medicines Integrating Crystallography in Drug Discovery, 
Erice, Italy. May 29 - June 8, 2008. 



UNIVERSITY COMMITTEES 

• Faculty steering committee for the shared computer cluster at QB3, Chair (2006) 

• Basic Sciences Research Resources Oversight Committee, UCSF (2005-) 

• Rock Hall Governance Committee, UCSF (2005 -) 

• QB3 Executive Committee (2004-) 

• QB3 Governance/Community Committee, Chair (2005-) 

• Mission Bay leadership committee, UCSF (2005-) 

• BPS Internal Advisory Committee, UCSF (2003-) 

• Biophysics Graduate Program Executive Committee, UCSF (2003-) 

• Systems Biology Faculty Search Committee (2005-) 

• MD/PhD Scientist at QB3 Search Committee, UCSF (2003-) 

• Bioinformatics and Medical Informatics Graduate Program Executive and Admissions 
Committee, UCSF (2003-) 

• Information Technology at School of Pharmacy Committee, UCSF (2003-) 

• QB3 Building Committee, UCSF (2003-2005) 

• Pharmaceutical Sciences and Pharmacogenomics Graduate Program Admissions 
Committee, UCSF (2003-2005) 



* School of Pharmacy Strategic Planning Committee, UCSF (2005) 

* Chancellor's Council committee, UCSF (2004) 

* Academic Information Technology Coordinator search committee, UCSF (2004) 

* Future of Computing at UCSF Committee (2003) 

* Bioinformatics and Computational Biology Faculty Search Committee, Co-Chair, UCSF 
(2003-2004) 

* Bioinformatics and Computational Biology Faculty Search Committee, UCSF (2002- 
2003) 

* Ad Hoc Faculty reviews (-15 since 2003) 

* Computer Security Committee, Chair, Rockefeller University (2000). 

* Faculty Awards Nominations Committee, Rockefeller University (2000-2002) 

* Bioinformatics Search Committee, Rockefeller University (2000-2002) 

* Dean's Graduate Studies Admissions Committee, Rockefeller University (2000-2002) 

* Chemistry Search Committee, Rockefeller University (1995-2000) 



TEACHING 

* Instructor (with Prof. Patsy Babbitt and Tanja Kortemme) for a yearly one semester 
graduate course "Bioinformatics 5 ' BM1-206 at UCSF, since 2004. 

* Instructor (with Prof. Patsy Babbitt) for a yearly one semester SOP course 
"Bioinformatics" BPS-l 14 at UCSF, since 2005. 

* Invited lecturing at several other courses and seminar series at UCSF (Bioengineering, 
M acromolecules) . 

* Chemistry and Chemical Biology Graduate Program Faculty member at UCSF, since 
2004. 

* Instructor for a one semester graduate course "Analysis and prediction of protein 
structures" at The Rockefeller University, in 1998, and 2000, 

* Guest lecturer at courses and workshops at Rockefeller University (1995-2002), Weill 
Medical College of Cornell University (1999-2002), New York University (2000-2002), 
Crystallography School in Erice, Italy (1999), FEBS course in Barcelona, Spain (1990), 
and Stanford University (2002), 



PREDOCTORAL MENTORING: 

Current graduate students: 

* Mark Peterson, BMI (shared with Prof Patsy Babbitt) (6/1/2003) 

* Michael Kim, BMI (6/1/2003) 

* Ranyee Chiang, BMI (shared with Prof. Patsy Babbitt) (9/1/2002) 



• Libusha Kelly, BMI (6/1/2003) 

• David Eraraian, Biophysics (8/1/2004) 

• Keren Lasker (shared with Prof. Haim Wolfson, Tel-Aviv University) 8/1/2006) 

• David Barkan, BMI (9/1/2006) 

Past graduate students: 

• Roberto Sanchez (8/20/1995-10/6/2000); Assistant Professor at Mount Mount Sinai 

School of Medicine, NY, since 10/9/2000 

• Eric Feyfant (1999-2001; postdoctoral fellow 2/10/2001-1 1/1/2001), scientist at Genetics 
Institute Inc. (now Wyeth Inc.), Cambridge, MA, since 1 1/2/2001 

• Bino John (8/1/2000-4/19/2003); Assistant Professor at The University of Pittsburgh 
School of Medicine, since 9/1/2005 

• Nebojsa Mirkovie (4/1/1999-10/1/2003); postdoctoral fellow with Diana Murray at Weill 
Medical College of Cornell University, NY, since 2/1/2004 

• Fred Davis, Biophysics (1/1/2003-3/1/2007) 



POSTDOCTORAL MENTORING: 

Current Postdoctoral Fellows: 

• Ursula Pieper (2000) 

• Narayanan Eswar (2000) 

• Mallur S. Madhusudhan (3/1/2000) 

• Frank Alber (04/1 5/200 1) 

• Dmitry Korkin (9/1/2002) 

• Min-yi Shen (2/1/2003) 

• Ben Webb (9/1/2003) 

• Friedrich Foerster (7/1/2005) 

• Hao Fan (9/1/2006) 

• Javier Velazquez (11/13/2006) 

• Daniel Russel (02/0 1/2007) 

Past Postdoctoral Fellows: 

• Ilya Vakser (1995-1997), Assistant Professor at University of South Carolina at 
Charleston until 2002; currently Associate Professor at SUNY Stony Brook 

• Andras Fiser (9/15/1997-12/31/2002), Assistant Professor at Albert Einstein College of 
Medicine, Bronx, NY, since 1/2/2003 

• Ash Stuart (7/1/1999-3/2003), Assistant Professor at Ramapo College, Mahwah, NJ. 

• Azat Badretdinov (1996-1999); senior scientific programmer at Accelrys Inc., San Diego, 



since 1999 

• Francisco Melo (9/1/1 998-1/1/2001), Assistant Professor at Pontificia Universidad 
Catolica de Chile, since 1/1/2001 

• Bozidar Yerkovich (2002-2003), Head of Structural Bioinformatics at Rosetta 
Inphannaiics Inc., Seattle, since 2003 

• Valya Ilyin (2000-2002), Assistant Professor at Northeastern University, Boston, since 
2002 

• Marc A. Marti-Renom (2/1/1999-5/31/2006); Adjunct Assistant Professor at UCSF, 
since 2/1/2003; Assistant Professor at Prince Felipe Research Center, Valencia, Spain, 
since 7/1/2006 

• Rachel Karchin (8/1/2003-8/31/06); Assistant Professor at Johns Hopkins University 

• Damien Devos (11/1 5/2002-6/30/06); at EMBL, Heidelberg. 

• Maya Topf (2/1/2003-10/30/06); Lecturer in Dept. of Crystallography, Birkbeck College, 
London, since 11/1/06. 

• Niu Huang (12/1/2003) 

• Andrea Rossi, Senior Scientist at Rinat Laboratories, Pfizer inc (1 1/1/2000-1 1/2006) 



FACULTY MENTORING 

* Assigned faculty mentor for Tanja Korternme (BPS) 



DOCTORAL DISSERTATION AND ORAL QUALIFYING EXAMINATION 
COMMITTEES (since 2004) 



2004 Tiba Ayunechi (Doctoral Dissertation) 

2004 Barbara Novak, BMI (Orals) 

2004 Alan Graves, Biophysics (Orals) 

2004 Mike Kim, BMI (Thesis Committee) 

2004 Alexandra Schnoes (Orals) 

2005 Ben Sellers, Biophysics (Orals) 

2005 Fred Davis, Biophysics (Thesis Committee) 

2005 Nathan Salomonis, PSPG (Orals) 

2005 Jerome Nilmeir, Biophysics (Orals) 

2005 Libusha Kelly, BMI (Thesis Committee) 

2005 Tuan Pham, BMI (Orals) 

2005 Ranyee Chiang, BMI (Thesis Committee) 

2005 Mark Peterson, BMI (Thesis Committee) 

2006 Marco Sorani, BMI (Orals) 



2006 
2006 
2006 
2006 
2006 
2006 
2006 
2006 
2006 
2007 
2007 
2007 



David Lomelin, BMI (Orals) 

Nima Fayazmanesh, Biophysics (Orals) 

Arjun Narayanan, Biophysics (Orals) 

David Eramian, Biophysics (Thesis Committee) 

Veena Thomas, PSPG (Orals) 

Dale Webster, BMI (Orals) 

Holly Atkinson, BMI (Orals) 

Dan Mandel, BMI (Orals) 

Mike Keiser, BMI (Orals) 

Elisabeth Humphris (Orals) 

Colin A. Smith (Orals) 

Greg Friedland (Thesis Committee) 



SUMMARY OF TEACHING HOURS: 

2003- 04: 50 hours of teaching. 

Formal class or course teaching hours: 18 hours. 
Informal teaching hours: 4 hours. 
Mentoring hours: 500 hours. 

2004- 05: 100 hours of teaching. 

Formal class or course teaching hours: 30 hours. 
Informal teaching hours: 6 hours. 
Mentoring hours: 500 hours. 

2005- 06: 100 hours of teaching. 

Formal class or course teaching hours: 30 hours. 
Informal teaching hours: 6 hours. 
Mentoring hours: 500 hours. 

2006- 07: 100 hours of teaching. 

Formal class or course teaching hours: 30 hours. 
Informal teaching hours: 6 hours. 
Mentoring hours: 500 hours. 

2007- 2008 : Anticipated total teaching load is expected to be similar to that in the previous year. 

Since my arrival to UCSF in January 2003, 1 joined Prof. Patsy Babbit in leading the graduate and 
professional students' courses in Biomformatics (BMI-206 and BPS- 114). Patsy and I also give 
many of the lectures and supervise student activity associated with the course (student seminars, 
exercise sets). I also participate as a guest lecturer in a number of other courses, such as 
Macromolecules. 



CONSULTING 



• Founder and consultant, Prospect Genomics Inc., San Francisco, now Structural Genomix 
Inc., San Diego (1999 to 2004). 

• Consultant, Acceirys Inc., San Diego (1994 to date). 

• Consultant, Biogen Inc., Cambridge, MA, 2000-2002. 

• Consultant, Millenium Inc., Cambridge, MA, 2001-2002. 

• Consultant, Wyeth Inc., Princeton, NJ, 2006. 



RESEARCH AWARDS AND GRANTS 



07/01/96 

09/01/96 

10/01/96- 

10/01/98- 
07/01/98- 
09/01/99- 

07/01/00- 
01/01/00- 

01/01/00- 

02/01/00- 
02/01/00- 
09/30/00 - 
06/01/01 - 
07/01/01 - 
07/03/01 



- 06/30/01 
-08/31/98 

- 09/30/98 

- 09/30/00 

- 06/30/99 
-08/31/03 

- 06/30/03 

- 12/31/04 

- 12/31/02 
-01/31/03 
■01/31/02 
■08/31/05 
03/31/04 
06/30/10 



Protein modeling by satisfaction of spatial restraints. NIH/NIGMS R29 
GM 54762. $350,000 in direct costs over the course of the grant period. 
Sinsheimer Scholar Award from Alexandrine and Alexander L. Sinsheimer 
Fund. $120,000. 

Acquisition of a multiprocessor computer for computational physics and 
structural biology. NSF BIR-9601845. $289,000. 
Alfred P. Sloan Fellowship. $40,000. 
Gift. Molecular Simulations Inc. $8,500. 

Disruption and expression of mast cell protease genes. NIH/NHLBI R01 
HL63284-02 (R. Stevens, PI; A. Sali, consultant). $145,000. 
Gift. Prospect Genomics Inc. $180,000. 

Career Scientist Award from Irma T. Hirschl Monique Weill-Caulier Trust 
$125,000. 

Comparative annotation of eukaryotic genomes: From gene detection to 
protein structure modeling. Mathers Foundation Award. $270,000. 
Target selection for the structural genomics of cancer. NIH/NCI R33 
CA84699 (T. Gaasterland, PI; A. Sali, Co-PI). $270,000. 
Database of comparative protein structure models for genomics. The 
Merck Genome Research Institute. $260,000. 

Center for Structural genomics. NIH/NIGMS P50 GM62529 (S.K. Burley, 
PI; A. Sali, co-PI). $850,000. 

Comprehensive map of cellular protein interactions. NIH/NCI R33 
CA89810 (B. Chait, PI; A. Sali, consultant). $170,000. 
Protein modeling by satisfaction of spatial restraints. NIH/NIGMS R01 
GM 54762. $1,400,000. 

Development of an integrated software environment for high-throughput 



structural biology and automated comparative protein structure modeling. 
Sun Academic Equipment Grant. EDUD-7824-020257-US, $294,185. 
07/01/03 - 06/30/06 3D~reconstruction and identification of postsynaptic molecular complexes 

images by electron cryotomography. Human Frontier Science Program 
Research Grant. RGP67/2003. (W. Baumeister, PI; A. Sali, Co- 
Investigator). $225,000. 

04/01/03-3/31/10 Pharmacogenetics of Membrane Transporters. U01 GM61390 (K. 

Giacomini, PI; A. Sali, collaborator). $550,000. 
08/01/03-07/31/05 Towards a comprehensive map of protein-ligand interactions. California 

Institute for Quantitative Biomedical Research. $120,000. 
08/03/03 IBM SUR award. Large-scale protein structure modeling and ligand 

docking (A. Sali, PI). $300,000. 
09/ 1 5/03 Intel Inc. Award. Large-scale protein structure modeling and ligand docking 

(A. Sali, PI), $70,000. 

1 1/01/03 - 10/3 1/06 Subnanometer structure based fold determination of biological complexes. 

NSF E1A-0324645 (W. Chiu, Bajaj, A. Sali, Pis), $520,000. 
02/15/04-02/14/06 Hierarchical framework for structural biology. Award from the Sandler 

Program in Basics Sciences. $150,000. 
04/01/04 - 03/31/09 Targeting Cysteine Proteases- Antiparasitic Chemotherapy. NIH P01 

A135707 (J. McKerrow, PI; A. Sali, Co-PI). $140,000. 
07/01/04-06/31/09 Deciphering Eznyme Specificity. NIH P01 GM71790 (J. Gerlt, PI; A. 

Sali, Co-PI). $400,000. 

09/01/04-08/31/05 Structure-based discovery of Hsp90 inhibitors. UC Discovery Grant 

bio03-10401 (D. Agard, PI, A. Sali, PI). $63,960. 
09/01/04-08/31/08 Cataloguing the Human Salivary Proteome. NIH U01 DEO 16274 (S. 

Fisher, PI; A. Sali, Co-PI). $200,000. 
07/01/05-06/30/10 Specialized Center for the Protein Structure Initiative. NIH U54 

GM074929-01 (B. Stroud, PI; A. Sali, Co-PI). $360,000. 
09/01/05 - 08/3 1/10 The Nuclear Information Pathway. NIH U54 RR022220 (M. Rout, PI; A. 

Sali, Co-PI). $1,259,214. 
09/01/05-08/31/10 NYSGXRC: A Large Scale Center for the Protein Structure Initiative. 

NIH/NIGMS U54 GM62529 (S.K. Burley, PI; A, Sali, Co-PI). 

$1,600,000. 

09/30/05 ~~ 09/29/10 Center for Protein Folding Machinery. NIH PN2 EY016525-02 (W. Chiu, 

PI; A. Sali Co-PI). $200,000. 



FELLOWSHIPS TO LAB MEMBERS 



Howard Hughes Predoctoral Fellowship (R. Sanchez) 
Howard Hughes Predoctoral Fellowship (F. Davis) 



Burroughs Wellcome Fund Predoctoral Fellowship (R. Chiang) 
Alfred P. Sloan Postdoctoral Fellowship (A. Stuart) 

Burroughs Wellcome Fund Postdoctoral Fellowship (A. Fiser, M. A. Marti-Renom) 
Rockefeller University Presidential Fellowship (M.A. Marti-Renom) 
Charles Revson Foundation Postdoctoral Fellowship (A. Fiser, M.S. Madhusudhan) 
NIH Postdoctoral Fellowship (R. Karchin) 

Burroughs Wellcome Predoctoral Fellowship (M. Kim, R. Chiang) 
DOE Predoctoral Fellowship (M. Peterson) 
Genentech Award (M. Peterson) 

Human Frontier Sciences Program Postdoctoral Fellowship (F. Foerster) 



PUBLICATIONS 

1. M. Renko, A. Sali, V. Turk, M. Pokorny, I. Kregar. "A neutral metalloproteinase from 
Streptomyces rimosus" Vestnik Slovenskega Kemijskega Drustva 32/2, 161-173, 1985. 

2. B. Lenarcic, A. Ritonja, A. Sali, M. Kotnik, V. Turk, W. Machleidt. "Properties and structure 
of human spleen stefin B - a low molecular weight protein inhibitor of cysteine proteinases" In: 
Cysteine proteinases and their inhibitors. Ed: V. Turk. Walter de Gruyter, Berlin, 473-487, 1986. 

3. V. Turk, J. Brzin, B. Lenarcic, A. Sali, W. Machleidt. "Human stefins and cystatins: Their 
properties and structural relationships" In: Cysteine proteinases and their inhibitors. Ed: V. Turk. 
Walter de Gruyter, Berlin, 429-441, 1986. 

4. M. Kotnik, A. Sali, J. Kos, B. Turk, V. Turk. "Nova metoda za hitro dolocanje kineticnih 
konstant pri interakciji encima s korapetitivnim inhibitorjera (A new method for rapid 
determination of kinetic constants for competitive inhibition of enzymes)" Vestnik Slovenskega 
Kemijskega Drustva 34, 369-377, 1987. 

5. A. Sali, V. Turk. "Prediction of the secondary structure of stefins and cystatins, the low Mr 
protein inhibitors of cysteine proteinases" Biological Chemistry Hoppe-Seyler 368, 493-499, 
1987. 

6. T. Lah, I. Kregar, A. Sali, B. Lenarcic, M. Kotnik, V. Kostka, V. Turk. "Circular dichroism 
studies of different aspartyl proteinases and their interactions with pepstatin" Periodicum 
Biologorum 90, 31-38, 1988. 

7. V. Turk, R. Jerala, B. Lenarcic, A. Sali. "Structural and functional aspects of human cathepsins 
B" In: Intracelullar Proteolysis: Mechanisms and Regulations. Eds: N. Katunuma and E. 
Kominami. Japan Scientific Societies Press, Tokyo , 27-37, 1989. 



8. A. Sali, B. Veerapandian, l.B. Cooper, S.I. Foundling, D.J. Hoover, T.L. Bhrndell. "High- 
resolution X-ray diffraction study of the complex between endothiapepsin and an oligopeptide 
inhibitor: The analysis of the inhibitor binding and description of the rigid body shift in the 
enzyme EMBO Journal 8, 2 1 79-2 1 88, 1 989. 

9. T.L, Blundell, G. Elliot, S.P. Gardner, T. Hubbard, S. Islam, M. Johnson, D. Mantafounis, P. 
Murray-Rust, J. Overington, J.E. Pitts, A.Sali, BX. Sibanda, J. Singh, MJ.E. Sternberg, M.J. 
Sutcliffe, J.M. Thornton, P. Travers. "Protein engineering and design" Philosophical Transactions 
of the Royal Society hand. B 324, 447-460, 1989. 

10. T.L. Blundell, D. Carney, T. Hubbard, M.S. Johnson, A. McLeod, J.P. Overington, A. Sali, 
M.S. Sutcliffe, P. Thomas. ! 'Knowledge-based protein modelling and design" In: Advances in 
Protein Design: International Workshop 1988 GBF Monographs 12. Eds: H. Bloecker and J. 
Collins and R. D. Schmid and D. Schomburg. VCH Braunschweig, 39-43, 1989. 

11. A. Sali, T.L. Blundell. "Definition of general topological equivalence in protein structures: A 
procedure involving comparison of properties and relationships through simulated annealing and 
dynamic programming" Journal of Molecular Biology 212, 403-428, 1990. 

12. A. Sali, J.P. Overington, M.S. Johnson, T.L. Blundell. "From comparisons of protein 
sequences and structures to protein modelling and design" Trends in Biochemical Sciences 15, 
235-240, 1990. 

13. B. Veerapandian, J.B. Cooper, A. Sali, T.L. Blundell. "X-ray analyses of aspartic proteinases 
III. Three-dimensional structure of endothiapepsin complexed with a transition-state isostere 
inhibitor of renin at 1. 6 A resolution" Journal of Molecular Biology 216, 1017-1029, 1990. 

14. J, Overington, M.S. Johnson, A. Sali, T.L. Blundell. "Tertiary structural constraints on 
protein evolutionary diversity; Templates, key residues and structure prediction" Proceedings of 
the Royal Society Lond. B 241, 132-145, 1990. 

15. J.P. Overington, M.S. Johnson, C. Topham, A. McLeod, A. Sali, Z.-Y. Zhu, L. Sibanda, T.L. 
Blundell. "Applications of environment specific amino acid substitution tables to identification of 
key residues in protein tertiary structure" Current Science 59, 867-874, 1990. 

16. M.S. Johnson, A. Sali, T.L. Blundell. "Phylogenetic relationships from three-dimensional 
protein structures" Methods in Enzymology 183, 670-690, 1990. 

17. M.S. Johnson, J.P. Overington, A. Sali. "Knowledge-based protein modelling: Human plasma 
kallikrein and human neutrophil defensin" In: Chemistry: Techniques Structure and Function. Ed: 
J. J. Villafxanca. Academic Press, Inc. San Diego, USA, 567-574, 1990. 



18. M.S. Johnson, J. Overington, A. Sali, Z. Zhu, D. Donnelly, P. Thomas, A. McLeod, R. 
Gooid, C. Topham, T.L. BIundelL "From comparative structure analysis to protein engineering: 
Knowledge-based protein modelling and design" Fresnius Journal of Analytical Chemistry 337, 1- 
3, 1990. 

19. T.L. Blundell, M.S. Johnson, J.P. Overington, A. Sali. "Knowledge-based protein modeling 
and the design of novel molecules" In: Protein design and the development of new therapeutics 
and vaccines. Eds: J. B. Hook and G. Poste. Plenum Press New York, 209-227, 1990. 

20. T.L. Blundell, J.B. Cooper, A. Sali, Z.-Y. Zhu. "Comparisons of the sequences, 3-D 
structures and mechanism of pepsin-like and retroviral aspartic proteinases" In: Structure and 
Function of the Aspartic Proteinases: Genetics, Structures, and Mechanisms. Plenum Publishing 
Company New York, 443-453, 1991. 

21. T.L. Blundell, J.B. Cooper, D. Donnelly, H. Driessen, Y. Edwards, F. Eisenmenger, C. 
Frazao, M. Johnson, K. Niefind, M.Newman, J. Overington, A. Sali, C. Slingsby, V. Nalini, Z- 
Y. Zhu. "Patterns of sequence variation in families of homologous proteins" In: Methods in 
Protein Sequence Analysis. Eds: Joraval, H66g and Gustavsson. Birkhauser Verlag, Basel , 373- 
385, 1991. 

22. A. Sali, J.P. Overington, M.S. Johnson, T.L. Blundell. "From modelling homologous proteins 
to prediction of structure" In: Protein design and the development of new therapeutics and 
vaccines. Eds: J. M. Goodfellow and D. S. Moss. Ellis Horwood Ltd. New York , 231-245, 
1991. 

23. A. Sali, B. Veerapandian, J.B. Cooper, D.S. Moss, T. Hofmann, T.L. BIundelL "Domain 
flexibility in aspartic proteinases" Proteins 12,158-170, 1992. 

24. B. Veerapandian, J.B. Cooper, A. Sali, T.L. Blundell, RX. Rosati, B.W. Dorainy, D.B. 
Damon, DJ. Hoover. "Direct observation by X-ray analysis of a tetrahedral intermediate of 
aspartic proteinases" Protein Science 1, 322-328, 1992. 

25. Z.-Y. Zhu, A. Sali, T.L. BIundelL "A variable gap penalty function and feature weights for 
protein 3-D structure comparisons" Protein Engineering 5, 43-51, 1992. 

26. J. Overington, D. Donnelly, M.S. Johnson, A. Sali, T.L. Blundell. "Environment-specific 
amino acid substitution tables: Tertiary templates and prediction of protein folds" Protein 
Science 1,216-226, 1992. 

27. M.S. Johnson, J.P. Overington, A. Sali, T.L. Blundell. "From the comparative analysis of 
proteins to similarity-based modelling" In: Computer Modelling of Biomolecular Processes. Eds: 
V. A. Ratner and N. A. Kolchanov. Nova Science Publishers, 6080 Jericho Turnpike, 191-196, 



1992, 



28. J.P. Overington, Z,-Y. Zhu, A, Sali, M.S. Johnson, R. Sowdhamini, G,V. Louie, T.L. 
BhmdelL "Molecular recognition in protein families: A database of aligned three™ dimensional 
structures of related proteins" Biochemical Society Transactions 21, 597-604, 1993, 

29. A. Sali, T.L. Blundell. "Comparative protein modelling by satisfaction of spatial restraints" 
Journal of Molecular Biology 134, 779-815, 1993, 

30. A. Sali, R. Matsumoto, H.P. McNeil, M. Karplus, RX. Stevens. "Three-dimensional models 
of four mouse mast cell chymases. Identification of proteoglycan-binding regions and protease- 
specific antigenic epitopes" Journal of Biological Chemistry 268 ? 9023-9034, 1993. 

SLA. Sali, T.L. Blundell "Comparative protein modelling by satisfaction of spatial restraints" 
In: Protein Structure by Distance Analysis. In: Protein Structure by Distance Analysis. Eds: H. 
Bohr and S. Brunak. IOS Press, Amsterdam, 64-86, 1994. 

32. A. Sali, J.P Overingtom "Derivation of rules for comparative protein modeling from a 
database of protein structure alignments" Protein Science 3, 1582-1596, 1994. 

33. A. Sali, EJu Shakhnovich, M. Karplus, "Kinetics of protein folding: A lattice model study of 
the requirements for folding to the native state" Journal of Molecular Biology 235, 1614*1636, 
1994. 

34. A. Sali, E.I. Shakhnovich, M. Karplus. "How does a protein fold?' 1 Nature 369, 248-251, 
1994. 

35. A. Dinner, A, Sali, M Karplus, E. Shakhnovich. "Phase diagram of a model protein derived 
by exhaustive enumeration of the conformations " Journal of Chemical Physics 101, 1444-1451, 
1994. 

36. M. Karplus, A. Sali. "Theoretical studies of protein folding and unfolding" Current Opinion 
in Structural Biology 5, 58-73 ? 1995, 

37. M. Karplus, A. Caflisch, A, Sali, E. Shakhnovich. "Protein dynamics: From the native to the 
unfolded state and back again" In: Modelling of Biomolecular Structures and Mechanisms. Eds: 
A. Pullman et al Kluwer Academic Publishers, Dordrecht, Netherlands, 69-84, 1995. 

38. A, Sali. "MODELER: Implementing 3D protein modeling' 1 mc 2 Molecular Simulations Inc. 
Burlington, MA ? 2, 5, 1995. 

39. X. Wu, B. Knudsen, S.M. Feller, J. Zheng, A. Sali, D. Cowburn, H. Hanafusa, J. Kuriyan. 
"Structural basis for the specific interaction of lysihe-containing proline-rich peptides with the 



ammo-terminal SH3 domain of c-Crk" Structure 15, 215-226, 1995. 

40. R. Matsumoto, A. Sali, N. Ghildyal, M. Karplus, R.L. Stevens. "Packing oi 
proteoglycans in the granules of mast cells and other hematopoietic cells. A cluster of totidines 
L mouse mast cell protease-7 regulates its binding to heparin serglycm proteoglycan" Journal of 
Biological Chemistry 270, 19524-19531, 1995. 

41. A. Sali. "Comparative protein modeling by satisfaction of spatial restraints" Molecular 
Medicine Today 1, 270-277, 1995. 

42. A. Sali,E. Shakhnovich, M. Karplus. "Protein Folding Studied by ^onte Ca^ Sim^tions- 
In, Protein Folds: A Distance Based Approach. Eds: H. Bohr and S. Brunak. CRC Press Inc., 
Florida, USA, 202-216, 1995. 

43 A Sali E Shakhnovich, M. Karplus. "Thermodynamics and kinetics of protein folding from 
lattice Monte Carlo simulations" In: DIMACS Series in Discrete Mathematics and Theoretical 
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RESEARCH PROGRAM (SEPARATE SUMMARY) 

The following five articles represent much of the research of the Sali group. 
1 NEW METHODS FOR COMPARATIVE MODELING 

B. John, A. Sali. "Comparative protein structure modeling by iterative alignment, model 
building! and model assessment" Nucleic Acids Research 31, 1982-1992, 2003. 

In the absence of an experimentally determined protein structure, homology or comparative 
protein structure modeling can often provide a useful 3D model of a given protein based on its 
similarity to a known structure. To increase the impact of comparative modeling on biology, we 
are maximizing its accuracy and applicability to as many proteins as possible. This is achieved 
by predicting and minimizing mistakes in fold assignment and sequence-structure alignment as 
welt as errors in the modeling of sidechains, loops, and core regions. As any other prediction, the 
accuracy of comparative modeling is limited by the information that can be used in model 
calculations (eg, known related structures, energy functions). Therefore, to provide us with a 
flexible framework for continued development, we previously introduced comparative modeling 
by satisfaction of spatial restraints (Sali and Bmndell, J. Mol. Biol. 234, 779-815 1993). This 
method computes models based on both a physical molecular mechanics force field as well as 
statistical preferences extracted from known protein structures and allows us to explore varied 
protein representations, scoring functions, and optimization protocols. 

In the present paper, we developed and benchmarked an approach to minimize errors in the 
alignment of the target sequence against the template structure. Alignment errors are the single 
most important limitation on comparative modeling because they are frequent and have a large 
impact on the model accuracy. Unfortunately, no current comparative model building method 
can generally recover from errors in the input alignment. To ameliorate this problem, we 
developed an automated method that optimizes both the alignment and the model implied by it. 
This task is achieved within the framework of modeling by satisfaction of spatial restraints with 
the aid of a genetic algorithm protocol that starts with a set of initial alignments, and then iterates 
through re-alignment, model building, and model assessment to optimize a model assessment 
score The average model accuracy increased from 43 to 54% (the model accuracy was measured 
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3. PROTEIN FOLDING . 

CM. Dobson, A. Sali, M. Karphis. "Protein folding; A perspective from theory and experiment 

Ang'chemJnt 37, 868-893, 1998. 

As the number of possible conformations for a polypeptide chain is astronomically large, a 
systematic search for the native structure would require an almost infinite length of time. To 
solve this apparent paradox, the "old view" of protein folding suggested a simple sequential 
model, which postulates a unique folding pathway with defined and sequential intermediates. 
In this article we synthesized evidence from both experiment and theory about the so-called 
"new view" of protein folding, which suggests that neither the folding pathway nor the set of 
folding intermediates is unique. Instead, the folding process is heteregenous resulting from a 
complex interplay between entropic and enthalpic contributions to the free energy of the system 
during the reaction. This novel perspective had a strong impact on both experimentalists and 
theoreticians interested in the protein folding problem. The term "new view" was introduced in 
the News & Views article (R.L. Baldwin. Nature 369, 183, 1994) that accompanied our original 
paper on Monte Carlo simulations of a simple lattice model of protein folding (Sail et at, Nature 
369 248-251 1994). In that publication, we determined the difference between folding and non- 
folding sequences for a simple 3D lattice model of a protein chain. Specifically, we generated 
many random sequences, tested them for their ability to fold to a stable native state, and finally 
compared the folding and non-folding sequences in terms of their structural and energetic 
properties The necessary and sufficient condition for a sequence to fold rapidly was a 
pronounced energy minimum for the native state. This finding was explained by an elementary, 
three-stage random search mechanism of folding, which did not involve a pathway (thus the 
label "new view"). 

These Monte Carlo simulations, together with theoretical models and experimental results, 
provide the support for the current "funnel"-like view of the protein foldmg landscape. The 
understanding of folding is important for the analysis of many events involved m cellular 
regulation the design of proteins with novel functions, the utilization of sequence information 
from the various genome projects, and the development of novel therapeutic strategies for 
treating or preventing human diseases that are associated with the failure of proteins to fold 
correctly. 

4 APPLICATIONS OF BIOINFORMATICS TO SPECIFIC BIOLOGICAL SYSTEMS 
D Deyos, S. Dokudovskava, F. Alber , R. Williams , B.T. Chait, A. Sali , M.P. Rout . 
"Components of Coated Vesicles and Nuclear Pore Complexes Share a Common Molecular 
Architecture" PLOS Biology 2, 1-9, 2004. 

It is essential for developers of computational methods to collaborate with experimentalists in 
applying their methods to practical problems. Such applications provide a validation of the 
methods as well as feedback for further development. Over the last decade, we collaborated with 
numerous biologists, producing 33 papers focused on structural and functional characterization 
of 22 specific proteins and protein families (see cv), with 7 new collaborations m progress. We 
illustrate this effort by one specific application, an assessment of the predicted folds of 
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